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Raymond S, Lim 
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Ames Research Center, NASA, Moffett Field, Calif. 94035 


SUMMARY 


This paper describes a proposed augmented single-burst-error cor- 
rection system for data stored in the UNICON Laser Memory. In the 
proposed system, a long Fire Code with code length n > 16,768 bits is 
used as an outer code to augment an existing inner shorter Fire Code 
for burst-error corrections. ‘fhe inner Fire Code is a (80,64) code 
shortened from the (630,614) code, and it is used to correct a single- 
burst error on a per-word basis with burst length b £ 6. The outer 
code, with b $ 12, would be userl to correct a single-burst error on a 
per-page basis, where a page consists of 512 32-bit words. In the pro- 
posed system, the encoding and error detection processes would be 
implemented by hardware. A minicomputer, currently used as a UNICON 
memory management processor, would be used on a time~demanding basis for 
error correction. Based upon existing error statistics, this combina- 
tion of an inner code and an outer code would enable the UNICON system 
to obtain a very low erzor rate in spite of flaws affecting the recorded 
data. The approach of the long Fire Code described here is also 
applicable to other mass memory systems (such as rotating disks) where 


single-burst-error correction can be obtained at very low redundancy. 
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INTRODUCTION 


The UNICON laser memory, or UNICON, is a digital memory manufactured 
by Precision instrument Lo. for the ILLIAC-IV computer as part of the 
mass I/O storage. The UNICON as an on-line storage capacity of about 


10! 


bits. In such a high-density data storage system, contamination 
and other defects can easily obliterate a group of data bits. In order 
to operate successfully in spit of this problem in the UNICON, an 
elaborate error-correction syst m is used, 

Error statistics obtained on the operation and checkout of the 
UNICON indicate that the desired net error rate os one bad word of data 
per 2x 10!! bits read might +e met if two Fire Codes were used for 
single-burst-error correction. These two Fire Codes would be arranged 
as a system consisting of an inner code and an outer code. The inner 
code would be a short (80,64) Fire Code used to correct an error burst 
of length b © 6 on a per-word basis. The outer code would be a long 
(16803 ,16768) Fire Code used to correct an error burst of length b = 12 
on a per-page basis. It should be noted that this system of an inner 
code and an outer code would not function in the same sense as the 
concepts of concatenated codes (ref. 1). Furthermore, the Reed- 
Solomon code from GF(2") successfully used in the IBM Photo-Digital 
memory (ref. 2) is not feasible for the UNICON because of the long 
codeword length of n=16803. 

The inner (80,64) Fire Code currently in use is implemented all in 
hardware by shift registers and error correction 1s done on-line as 


data are read off the strip. The (16803,15768) outer Fire Code would 


be decoded in a manner suggested by Chien (ref. 3). A similar scheme 
for burst error correction with b < 11 was implemented successfully in 
the IBM model 3830 controller for the Model 3330 disk. The problem of 
effectively implementing the (16803,16768) outer code has been solved 
by a number o: innovations and in a hybrid manner. Most important is 
the use of hardware for encoding, while using a minicomputer on a time- 
demanding basis for error correction, The minicomputer performs a 
calculation to find the burst error location i by applying two well- 
known theorems from Number Theory: the Euclidean algorithm for 
division and the Chinese remainder theorem for simultaneous congruences, 
sometimes thought to be me, ly of academic interest only to Number 


Theorists. 
CODING REQUIREMENT FOR UNICON 


The basic storage element of the UNICON is a metal coated plastic 
strip about 4.75 x 31 in. The recording density across the strip is 
11805 tracks over 3.8 in., or about 3000 tracks per in. Each track 
can store up to eight records of data, with each record containing 
16,768 data bits plus other record identification bits. The method for 
recording data on the strip is tw use the laser to either burn a hole 
for a l-bit, or not to burn a hole for a O-bit. 

Data entering and leaving the UNICON are handled by a UNICON 
Controller (ref. 4). The Controller has two interfaces: a central 
memory (CM) interface for data, and a minicomputer interface for 
control. The minicomputer (a PDP-11) functions as the UNICON memory 


management processor (UMP). The UMP obtains its control programs 


through its own virtua’ hardware interface into the CM. By means of 
this virtual address space, UMP can address up to 4096 pages in CM, 
Data transfer to the UNICON is always one page of data at a time from 

CM, where a page of data consists of 512WX32B, which is 16,384 bits. 

Por each page of data, the controller attaches 16WX16B of header in- 
formation and 8WX16B of checksum parity bits to form a data record of 
16,768 bits. The checksum is simply a single-parity check with check 
symbols from Gr (2! 78) , 

The basic word length in the UNICON is 64 bits, As previously 
mentiored, the UNICON currently has a built-in (80,64) Fire Code which 
is used to correct a single-burst error with burst length b < 6 in a 
per-word-basis. Because of optic imperfections and other mechanical 
flaws, data read back from the strip always contain errors. When the 
UNICON is properly aligned, most errors are correctable by the (80,64) 
Fire Code. Preliminary observations on the error statistic indicate 
that the probability of getting an error not correctable t 5 Fire 
Code is about one for every 1o!! bits of data transferred, wiich is about 
one error per strip. The observed uncorrectable errors, for those that 
are understood and the cause can be explained, are typically single- 
burst errors with b £ 11. Thus if a long Fire Code with b * 12 is used 
to augment the existing (80,64) Fire Code to correct a single-burst- 
error on a per-page basis, then it is possible that the error rate can 
be reduced to a very low level. The long Fire Code chosen is a 


(16803,16758) code. In this arrangement, the (16803,16768) code is 


called the outer code, while the (80,64) code is called the inner code. 


DESCRIPTION OF THE FIRE CODE 


An error burst of length b is defined as a vector whose non-zero 
components are confined to b consecutive bit positions, the first and 
last of which are non-zero. It is known that cyclic codes for single- 
burst-error correction can be systematically constructed (refs. 5 and 6). 
These codes are rather efficient and they can be implemented with 
feedback shift registers. An important class of these codes is known as 
the Fire Codes (ref. 7). In the discussion which follows, let 

n = length of codeword in bits. 

k = number of message bits in a codeword. 

n-k = number of redundant bits in a codeword. 

b = maximum length of single error burst which can be corrected, 

in bits. 

m(x) = message word. 

v(x) = transmitted codeword. 

E(x) = error burst. 

R(x) = v(x) + E(x) = received codeword, 

It is clear that for a given k and b, the objective is to construct 
an (n,k) code with as small a redundancy n-k as possible. 

It is also known from che Reiger bound (ref. 8) that the number of 
parity check bits of a b-burst-error correcting code is 

n-k 2 vb (1) 
This is an upper bound, and codes which meet the Reiger bound are said 


to be optima'. The ratio 


= 2b f 
t* ak ad 
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is a measure of the burst-correcting efficiency of the code. An 
optimal code has Z=1, The fire code at best has 

3b-1 * 3 7 
which is not very optimal with respect te the Reiger bound. 

The Fire Code is a linear cyclic co‘'e which can be systematically 
constructed for correcting a single Lurst of error in a codeword of n 
bits. The constructed codeword is in systematic form; that is, the 
n-k parity check bits are simply concatenated after the message bits. 
A Fire Code with code symbols from GF(2) which is capable of 
correcting any burst of length b or less and of simultaneously 
detecting any burst of length d= b is best described by its generator 


polynomial 
g(x) = p(x) (x* + 1) (4) 


p(x) = an irreducible polynomial of degree m whose roots hive 
order e; that is, the period of p(x) is e. 
c 2bed-1 


d 2b 
m 2b 
(c,e) = 1, that is, c and e are relatively prime. 


For pure error correction purpose, the best choice is to set m=b and 
d=b. This results in a Fire Code with the following parameters: 
n = LCM (e,c) = ec 


n-k =c +m = 3b - 1 


jee 


k = ec - (3b - 1) 


ils 


corer 


Another version of the Fire Code which is the ha'f-length version, has 


the following code paraments: 


ne S) e, where c is an even integer. 


n-k = 3b - 2 
k = (5) e - (3b - 2) 


In the conventional decoder implemented by shift registers, 
decoding the Fire Code requires two n shifts, n shifts for parity 
checking and another n_ shifcs for error correction. As the first 
n shifts attributed to parity checking are concurrent with the reading 
r vation, no time delay occurs in this operation. The second n 
shifts needed to locate the error burst and correct it is a time delay 
due to error correction. If n is very large, like 16768 bits, then 
this long time delay may be intolerable in real life operation. 
However, if there exists within easy reach some power for general 
computation, resided either within the host computer or within the 
controller of the storage system, then the second n_ shifts can be 


reduced to the minimum of (e+c-2) plus computation time. 
THE (80,64) FIRE CODE AND ITS IMPLEMENTATION 
The generator polynomial for the (80,64) Fire Code is 

g(x) = (x9 +x ed) O41) (5) 


In this case, p(x) = (x® + x +1) is primitive so the order of 


its roots i3 


Te ae ee (6) 


The code parameters for this code are as follows: 


(63,10) = 1, thit is, 63 anc 10 are relatively prime. 
n= ec = 63 x 10 = 630 
n-k=e=c+mes 10 +628 16 
k = 630 - 16 - 614 


6 so that b = 6 


c= l0=bed-i1 
=6+d-1 so thatd=5 
Note that the condition d 2 b is not met. This means that this code 
probably will not be able to correct all possible single-burst errors 
of length b ¢ 6. 

From the above calculations, the code generated by g(x) in 
equation (S) is a (630,614) code. Since the word length of the UNICON 
is 64 bits, the code must be shortened to a (80,64) code by making the 
(614 - 64) = 550 most significant bits of the message equal to zero and 
omitting them, This process will not affect the encoding and parity 
checking calculation since leading zeros will not affect them. It does, 
however, affect the error correction procedure since the unaltered 
procedure would require 550 shiits corresponding to the 550 omitted 
zeros before reading the actual received codeword out of the buffer. 
Instead of making the buffer wait for 550 shifts, the standard 
procedure is to premultiply the decoding shift register by 4°50 
modulo g(x). 


The parity-check calculation, as it stands, is the residue of 


x16 R(x) modulo g(x). An additional automatic multiplication by 


550 


566 


x is desired; that is, the residue X is desired. With the 


help of a program written in basic, the remainder r(x) after 


566 


dividing X by g(x) is found to be 


x)? ‘ xi! 6 3 2 


r(x) = (x!5 + ix 2 +8: + 


This basic program is shown in figure 1. The encoding shift register 


is shown in figure 2, and the decoding shift register is shown in 


figure 3. 
THE (16803,1676{, ?IRE CODE AND ITS IMPLEMENTATION 


The objective of this code is to have k = 16768 and b = 12. 
The procedure to find a Fire Code that meets this requirement is as 
follows: 
1. Letm=d=be#i12 and c# 2b-1 = 23 
so that c + m= 23 + 12 = 35 
2. g(x) = p(x) (x75 + 1) 


~n., k+ (n-k) _ 16803 
% £23 . “y= «515+ 


Thus p(x) has degree m= 12, and a period e 2 514 is 
desired. With some looking, the trinomial cx? + > +1) is 
not primitive but has a period e = 819 is acceptable. Thus 
g(x) = (x07 + x9 1x + 1), 
Now that g(x) is found, the code parameters are: 
n=e = 819 x 23 = 18837 
n-k =c +m = 23 + le = 35 


k = (18837 - 35) = 18802 : 


The result is a (18837,18802) code, and this code can be shortened to 
(16803 ,16768) by omitting 2034 leading zeros. 

The error correction part of the decoding requires n = 16805 
shifts, which is too long a time delay for practical application, 
However, there exists a high-speed decoder for this code suggested by 
Chien (ref. 3). This high-speed decoder consists of two feedback shift 
registers, one based on the factor (x©+1) and the other based on the 
factor p(x). The two registers are run in synchronism for parity 
checking and error detection. A general diagram of this decoder is 
shown in figure 4. In this decoder, the syndrome is presented by the 
remainder of (x41) and p(x), and the error pattcri is identified 
when it has the same representation in both registers. The operation 
of this decoder is as follows: 

1. Shift both registers n times to enter the received 

codeword R(x). 

2. If R(x) has no error, the syndrome in both registers is 

zero. 

3. If the syndrome is not zero, shift only the (x*+1) register 

until its (b-1) high-order bit positions equal to zero. 
The error pattern is contained in the b iow-order bit 
positions. The maximum number of shifts is (c-1). Freeze 
this register. 

4. Next shift the p(x) register until its contain is matched 

with the error pattern in the (x©+1) register. The maximum 
number of shifts is (e-l). If a correctable error occurs, 


this match condition will occur, otherwise ar uncorrectable 


error is detected. 


From the above description, it is noted that the maximum decoding time 
is equal to the minimum of (e+c-2), as compared to n_ for the con- 
ventional decoder. 

Suppose a correctable burst of length <b at location 


i, o ¢ i <n, has occurred, then the error can be written as 


E(x) = x? B(x) (8) 

The remainders calculated in the registers are 
$(x) = x* B(a) modulo p(x) (9a) 
S_(x) = x* B(x) modulo (x°+1) (9b) 


It is noted that the syndrome S.(x) is capable of determining the 
burst Lacicern and its location up to a multiple of c. Therefore, by 
saifting the syndrome S$. (x) in its register and testing for zero at 
the (b-1) high-order bit positions, the error pattern and its location 
i can be determined quickly up to multiple of c. That is, 
iz r. modulo c (10a) 
where o * e, 2 ¢s Now that the error pattern is determined, the p(x) 
register can be shifted until a match is obtained. This gives 
iz T mouulo e (10b) 
If i can be solved in equation (10), then the decoding is complete. 
The preblem in equation (10) is a classical problem of simultaneous 
congruence in Number Theory. If e and c are relatively prime, then 
the Chinese Remainder Theorem can be used to solve for i. 
The Chinese Remainder Theorem simply states that 
"Numbers which satisfy the simultaneous congruence i = a, 
(mod 


(mod Uy)>- 1 Sp Dies jexist if the Ms are relatively 


j j 


nit. 


prime in pairs, and such numbers constitute a single number 
class modulo 1 Wye 
1 
Now to solve for i. According to the Euclidean division algorithn, 


if (e,c) = 1, there exists intergers A. and Ay such that 


A, e+ A, c=} modulo (n = ec) (11) 
Multiply both sides by i, the result is 

i= (A, e) i+ (A, ec) i modulo n (12) 
From the shift registers, note that 

i- q. ¢ +f, for (x° + 1) register (13a) 

i= % e+ T) for p(x) register (13b) 


By substituting equation (13) into equation (12), the result is 


i= (A. e) %, * (A, c) T) modulo n (14) 
Since 

A, q,e¢ = 0 modulo n (15a) 

Ay 4 eczo0 modulo n (15b) 


The key equation is equation (14). The computation of A. and A, can 
be done easily off line by the UMP with the numbers Ae and A,e 
stoved in its memory. Once the error-pattern-match condition is found 

| in the decoder, vr and tr) are available and the UMP can compute i 
by equation (14). Once i is solved, UMP will fetch the appropriate 


words from CM and perform the error correction with the error pattern 


stored in the p(x) register. 
: It is not convenient to describe the detailed implementation of the 
(16803,16768) Fire Code since its code length is too long, 16803 bits. 


However, in order to illustrate the principle of this high-speed decoding 


xt 


Le 


~~ 


scheme, a shorter (35,27) Fire Cod2 with b <3 shortened to a (24,16) 


code is fully illustrated in the Appendix. 


CONCLUSION 

This paper has presented an augmented scheme for single-burst-error 
correction for the UNICON laser memory. The augmentation is the 
addition of an outer code, a (16803,16768) Fire Code, to correct for a 
single error burst with length b < 12 on a per-page basis. This 
(16803,16768) Fire Code is an addition to the existing (80,64) Fire 
Code which has already been implemented in the UNICON to correct a 
single error wurst with length b £6 ina per-word-basis. Based upon 
observed error statistics, this augmented scheme would reduce the 
error rate to about one errvr per 2 x 10! bits of data transfer, which 
is about »>ne error per two strips read. The (16803,16768) code would 
be implemented by a hybrid method using a high-speed hardware decoder 
for syndrome calculation and a minicomputer for error location 
computation and error correction. For the case where the periods of 
the ti. polynomial factors of g(x) are relatively prime, a simple 
application of the Chinese Remainder Theorem would determine the burst 


location, 
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APPENDIX 


The purpose of this appendix is to illustrate the principle of the 
high-speed decoder by implementing a (24,16) Fire Code shortened from 
the (35,27) code. The generator polynomial for this code is: 

g(x) = +x +1) O41) 
where p(x) = a +x = 1) is primitive and hence its order e = 7, 
The parameters of the code are: 

m= bes 3 . d=b=3 

c= 2b-12#5 

neec«7x5=s 35 

n-k=c+mes= 3b-128 

k = n =~ (n-k) = 27 
The computation of A.e and Ase are as follows: 

A. e+ Ay cl mod 35 

7A, + 5A, = 1 mod 35 

A, = 3 and A, = 3 
Thus 

i = (21 =. * 15 T)) mod 35 

Lets shorten the (35,27) code to a (24,16) code by making the leading 
1l message bits equal to zero. Encoding is not affected. However, 
decoding must account for the missing 11 leading zeros. Any multiple 
of C =5 shifts of the (x® + 1) register has no effect. Any 
multiple of e= 7 shifts of the p (x) register has no effect. 
For the (x® + 1) register, 


(3 + 11) - (5 x 2) = 4 shifts 


ae 
oP? = er a 


meaning input data should be connected to the inpv't of the Sth stage 
of the (x® +1) register. For the p(x) register, 
(3 + 11) - (7 x 2) = O shifts 
meaning input data should be connected to the input of the first stage 
of the shift register. The decoder is shown in figure A-1. 
For a message m(x) of 
0 123 456 789 101112 13 1415 
m(x) = 1 O11 O00 101 2 8 00 1 
the encoded V(x) is 
0 123 456 789 101112 13 1415 01 234 567 


! 
| 
1 O11 OOO 101 i ee ee l he oll 101 | 
| 
n-k 


m(x) 


In these examples, bit-0 is the MSB and it is sent first. To illustrate 
the principle of error correction, lets introduce two error patterns as 
follows: 


Error Pattern No. 1 


BO 828 cevcise 9 10 123 12 5 15 OL iscet 
E, (x) = 0 GOO sesvce orm FF. 4 | 0.... 0 | 00....0 
i= 10 
Error Pattern No. 2 
O1ZS 6 SE F BD revcccvesscencsecs 1S 4 Ober? 
E,(x) = 0 000 O [211 [oo Cohen erdnetorenen 0 ! 00....0 
i=5 


The corrections for these two error patterns are shown in figures 


A-2 and A-3, respectively. 
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REM==THIS PROGRAM COMPUTES X(EXPN) MOD G(X) RY LINEAR SHIFT 
REMe=REGISTF® SEQUENCES, 
REM@emINITIALIZATION 
PRINT "NM, 

INPUT N 

LET tet 

OIm x(8@) 

rUR Js@ TO 16 

x(J) 0? 

NEXT J 

KEM@eSHIFT RIGHT SUBT, 
xCL6)exK(18) 
x()5)ex(14) 

O24) 8x43) 
¥(13)ex(12) 

KCL 2)ex(11) 

IF KC16)exX(18) GOTO 63 
xCLides 

GOTO 9@ 

XCLi)8¢ 

IF X¢16)8x(9) GOTO 93 
XC1@) 04 

GOTO 100 

x(12@)00 

x(9)=x(8) 

x(B)28x(7) 

K(7)8X(6) 

IF X¢€16)8xX(5) GOTO 133 
x(6) 81 

GOTO 140 

x(6) 00 

x(5)8X(4) 

x(4)ex(5) 

x(4) 8X2) 

x(2) 8x(1) 

IF x(16)sx(@) GOTO 183 
X(t) 8] 

GOTO 19H 

x(1)9@ 

IF X(16)81 GOTO 193 PRECEDING PAGE BLANK NOT FILMED 
x(U)el 

GOTO 24a 

x¥(@) su 

REM@=@END OF SHIFT RIGHT SUBT, 
LET *%(16)80 

LET [80 

LET NeNney 

IF Noe? GOTO 65 


PRINT "X(@O), XC1)- e XC15)" 


seer 
PRINT K(@)sXCL)9XC2])9KC39 N04) X05) KOE XC7) 
PRINT K(B)9X(9) sKC1A)eKCL 


L)geKCL2VsKCL BI eKCL4d gp KC15) 


PRINT 
GOTO 2@ 
END 
; . 566 
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Figure A-2.2.- p(x) decoding for error pattern No. 1. 
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Figure A-3.1.- (x*+1) decoding for error pattern No. 2. 
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Figure A-3.2.- p(x) decoding for error pattern No. 2. 


